Measures for the degree of overlap of gene signatures and applications to TCGA

نویسندگان

  • Xingjie Shi
  • Huangdi Yi
  • Shuangge Ma
چکیده

For cancer and many other complex diseases, a large number of gene signatures have been generated. In this study, we use cancer as an example and note that other diseases can be analyzed in a similar manner. For signatures generated in multiple independent studies on the same cancer type and outcome, and for signatures on different cancer types, it is of interest to evaluate their degree of overlap. Many of the existing studies simply count the number (or percentage) of overlapped genes shared by two signatures. Such an approach has serious limitations. In this study, as a demonstrating example, we consider cancer prognosis data under the Cox model. Lasso, which is representative of a large number of regularization methods, is adopted for generating gene signatures. We examine two families of measures for quantifying the degree of overlap. The first family is based on the Cox-Lasso estimates at the optimal tunings, and the second family is based on estimates across the whole solution paths. Within each family, multiple measures, which describe the overlap from different perspectives, are introduced. The analysis of TCGA (The Cancer Genome Atlas) data on five cancer types shows that the degree of overlap varies across measures, cancer types and types of (epi)genetic measurements. More investigations are needed to better describe and understand the overlaps among gene signatures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expression Profiling of Microarray Gene Signatures in Acute and Chronic Myeloid Leukaemia in Human Bone Marrow

Background Classification of cancer subtypes by means of microarray signatures is becoming increasingly difficult to ignore as a potential to transform pathological diagnosis nonetheless, measurement of Indicator genes in routine practice appears to be arduous. In a preceding published study, we utilized real-time PCR measurement of Indicator genes in acute lymphoid leukaemia (ALL) and acute m...

متن کامل

Exploring Gene Signatures in Different Molecular Subtypes of Gastric Cancer (MSS/ TP53+, MSS/TP53-): A Network-based and Machine Learning Approach

Gastric cancer (GC) is one of the leading causes of cancer mortality, worldwide. Molecular understanding of GC’s different subtypes is still dismal and it is necessary to develop new subtype-specific diagnostic and therapeutic approaches. Therefore developing comprehensive research in this area is demanding to have a deeper insight into molecular processes, underlying these subtypes. In this st...

متن کامل

Study of Gene Expression Signatures for the Diagnosis of Pediatric Acute Lymphoblastic Leukemia (ALL) Through Gene Expression Array Analyses

Background: Acute lymphoblastic leukemia (ALL) as the most common malignancy in children is associated with high mortality and significant relapse. Currently, the non-invasive diagnosis of pediatric ALL is a main challenge in the early detection of patients. In the present study, a systems biology approach was used through network-based analysis to identify the key candidate genes related to AL...

متن کامل

کاوش ژنومی نشانه های انتخاب در گاوهای بومی نژاد سرابی و تالشی ایران

The aim of this study was to find the footprint of selection in native Sarabi and Taleshi cattle breeds 296 cattle from two breeds were sampled and genotyped. by 40 k microarray of illumine company. 43 animals were removed because their ACR was below 0.09. Markers were filtered with minor allele frequency (MAF) equal 0.01 and Hardy-Weinberg equilibrium test (10-6). After filtering, 28782 marker...

متن کامل

Fuzzy relations, Possibility theory, Measures of uncertainty, Mathematical modeling.

A central aim of educational research in the area of mathematical modeling and applications is to recognize the attainment level of students at defined states of the modeling process. In this paper, we introduce principles of fuzzy sets theory and possibility theory to describe the process of mathematical modeling in the classroom. The main stages of the modeling process are represented as fuzz...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Briefings in bioinformatics

دوره 16 5  شماره 

صفحات  -

تاریخ انتشار 2015